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Ontological models are attempts to quantitatively describe the results of a probabilistic theory, 
such as Quantum Mechanics, in a framework exhibiting an explicit realism-based underpinning. 
Unlike either the well known quasi-probability representations, or the "r-p" vector formalism, these 
models are contextual and by definition only involve positive probability distributions (and indicator 
functions). In this article we study how the ontological model formalism can be used to describe 
arbitrary statistics of a system subjected to a finite set of preparations and measurements. We 
present three models which can describe any such empirical data and then discuss how to turn an 
indeterministic model into a deterministic one. This raises the issue of how such models manifest 
contextuality, and we provide an explicit example to demonstrate this. 

In the second half of the paper we consider the issue of finding ontological models with as few 
ontic states as possible. 



PACS numbers: 

I. INTRODUCTION 

Entanglement plays an important role in both quan- 
tum foundations and quantum information. Recently 
however, an overlap between these two fields has emerged 
which goes beyond this somewhat obvious reason for 
common interest. For instance, there is mounting evi- 
dence that contextuality plays a role in any enhancement 
of quantum over classical communication protocols [1]. A 
second example is work on the "r-p" framework for an- 
alyzing the information processing power of general (not 
necessarily quantum) probabilistic theories [2, 3, 4, 5] - a 
body of work to which this paper is closely related. Fi- 
nally we mention work by one of us [6] showing that one 
particular class of ontological models [7, 8] (or 'hidden 
variable theories') of quantum mechanics has a compu- 
tational power slightly greater than that of regular quan- 
tum computation (if access to values of the hidden vari- 
ables is granted). 

As in this last example, we will be considering onto- 
logical models. Such models reproduce the predictions of 
quantum mechanics by using (positive, normalized) prob- 
ability distributions over a space of (sometimes, though 
not always, hidden) variables to describe the state of a 
system. Unlike quantum states in operational quantum 
mechanics, these variables are presumed to correspond 
directly to "properties of reality" in some fashion - hence 
we term the variables the ontic states. Measurements 
in these models are described by positive valued indi- 
cator functions - functions which simply determine the 
probability of a system in a given ontic state yielding the 
particular measurement outcome associated with that in- 
dicator function. In this way ontological models differ 
from the many well known quasi-probabilty representa- 
tions of quantum mechanics. In those representations, 
even if the states are represented by positive, normal- 
ized probability distributions (e.g. the Q-function), the 
measurements will be represented by indicator functions 



which can be negative (or greater than 1). Such negativ- 
ity is difficult to interpret if one is interested in describing 
properties of a reality underpinning quantum mechanics. 
In fact Spekkcns has shown that such negativity arises in 
these representations precisely because they are assumed 
non-contextual [9]. In particular, this assumption of non- 
contcxtuality is made in the work on the r-p formalism 
mentioned above, and is the reason the r-vectors (those 
vectors associated with measurements) generally have to 
contain negative elements. Similar considerations apply 
to the work on discrete Wigner distributions [10]. 

A comprehensive discussion of the ontological model 
framework was undertaken in [7]. For our purposes we 
will not need the majority of the formalism developed in 
that paper. However, let us mention in passing a few 
of the conclusions from it. Firstly, there exist a wide 
variety of ontological models, both deterministic and in- 
deterministic, including ones which, unlike the examples 
given in [6] , do not provide greater computational power 
than that of regular quantum computation. Secondly, 
for some models it is necessary to consider not just ontic 
states of the system under consideration, but also those 
of the preparation and measurement devices. Thirdly, 
the manner in which such models exhibit contextuality 
is varied, but one feature (which in some sense subsumes 
contextuality) that is common to all models is a property 
we term deficiency. Loosely speaking, deficiency breaks 
a symmetry between preparations and measurements in 
quantum mechanics. It is the property that the set of on- 
tic states which a system prepared in quantum state 
may actually be in, is strictly smaller than the set of on- 
tic states which would reveal the measurement outcome 
with certainty. 

In this paper, we initiate an investigation into how on- 
tological models can be used to reproduce quantum me- 
chanical statistical predictions for a discrete set of prepa- 
rations of, and measurements upon, a given system. We 
start in section II by discussing a method of representing 
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empirical data in matrix form. In Section III we then in- 
troduce a matrix factorization of this data, equivalent to 
an ontological model over a finite number of ontic states. 
In Sections IIIB, IIIC and HID we present three fac- 
torizations which can describe any such empirical data. 
We then discuss in section III E how to turn an indeter- 
ministic ontological model into a deterministic one. This 
raises the issue of how such models manifest the neces- 
sary contcxtuality, and in Section IIIF we discuss this 
and provide an explicit example. 

In the second half of the paper, Section IV, we consider 
the issue of finding ontological models with as few ontic 
states as possible - a topic with potential application to 
the classical simulation of quantum systems. 



II. DATA TABLES 

We begin with a common [2, 3, 4, 5], although some- 
what idealistic, formalization of the process of perform- 
ing an experiment resulting in probabilistic data - data 
for which we then seek an explanation. The experiment 
consists of a set of preparation procedures, and a set of 
measurement procedures [26]. The preparation proce- 
dures are defined in terms of different macroscopic (and 
therefore distinguishable) configurations (settings) of an 
apparatus, and these are labeled PW, where i = 1,2, ..s 
(s standing for "states"). Likewise the different mea- 
surement procedures are macroscopically distinguishable, 
and can be labeled where i = 1,2, ..m. Each mea- 

surement procedure has some number d of outcomes, 
which, by padding with null outcomes if necessary, we 
take to be the same for each. We can then label the 
occurrence of the fc th outcome, when the measurement 
procedure is as M% . 

We imagine that a large number of experiments arc 
performed, and the probabilities that a given prepara- 
tion yields a given measurement outcome estimated and 
tabulated, e.g: 
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In this example s = 4, m = 2 and d = 3, so that there 
are dm rows in the matrix. The sum of every d elements 
within each column must be 1, since one of the outcomes 
must obtain for every measurement. For clarity we have 
drawn a horizontal line underscoring the separation be- 
tween distinct measurements. 

We will call such a matrix of values a data table, and 
generally denote it D. It will often be convenient to con- 
sider D a three-dimensional array, where the particular 



measurement being performed lies along the third dimen- 
sion. For the above example we can do so by defining 
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More generally the data table can be taken as specified 

(x) 

in terms of D\ \. where x = 1, 2, ... to, i = 1,2, ... a, 
i = l,2,...s. 

Now in quantum mechanics we generally associate a 
density matrix pk with a preparation V^ k \ and a POVM 

(x) • (x) 

element E\ with a measurement outcome M\ . Since 
we will frequently be considering data tables for which 
the table entries are presumed to be given by quantum 
mechanical expectations - i.e. Tr(phE^ x '), it is often con- 
venient to label the rows and columns via the appropri- 
ate quantum operators. In fact, we will almost always 
only be considering procedures which correspond to pure 
quantum states \tpk) and to sets of sharp (projective) 
measurements {IT} (also known as PVM's). In this case 
we have 
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and it will be convenient to adopt a more quantum no- 
tation along the lines of: 
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At this stage it is pertinent to consider the following: 

Proposition 1 Every possible data table can be realized 
via standard quantum expectation values of projective 
measurements on pure states. 

At first sight this proposition may appear incorrect. Con- 
sider, for example, the following data table: 
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As there are two possible outcomes for each measure- 
ment, one might surmise that the data was generated by 
sharp measurements on a qubit prepared in cither \ipi) 
or \ipi). In that case the table is clearly impossible - 
the first measurement implies the two states are orthog- 
onal, and the second implies there is a two outcome mea- 
surement which yields one of the states with certainty, 
but which is unbiased with respect to the second state. 
However, consider if the system was quantum mechani- 
cally actually a three dimensional qutrit with basis states 
|0), |1), |2). In this case it is easy to verify that choosing 
IV'i) = |0), \4>2) = the two sets of measurements could 
be {|0)(0| + |2)(2|, |1)<1|} and {|0)<0| + |+)(+|, |-)(-|}, 
where |±) = (|1) ± \2))/y/2. That is, the states are sharp 
and the measurements are projective. The point is that 
there is no way of determining from the raw data table 
itself any presumed (quantum) Hilbert space dimension 
of the systems under investigation. We will use the gen- 
eral notation cLq to denote the Hilbert space dimension 
of such a quantum representation of a data table - for 
this example cLq = 3. 

Proposition 1 asserts that every data table can be so 
constructed - that is, by pure quantum states and pro- 
jective operators. That this is possible can be seen as 
follows. Since we are not restricted in g?q, let us imagine 
that every preparation procedure (column of the data ta- 
ble) is represented by a state which is orthogonal to all 
the other states. That is, each state lies in its own sub- 
space of the total Hilbert space. Clearly, if we can choose 
measurements so as to reproduce any desired statistics 
for one such state then, since the states are all orthog- 
onal, we can piece together measurement operators for 
the different states in a direct sum to produce suitable 
measurement operators for the whole table. Focussing on 
one state then, imagine the measurement under consid- 
eration has outcomes occurring with probabilities pi. All 
we need to establish is that, for a given there is a way 
of choosing projection operators such that (V>|IIi|i/>) = pi. 
One way of constructing suitable operators is to work in 
the basis where = [1, 0, 0, . . ] T . Let v be the row 
vector [^/pi, \/P2, ■ ■ ■] 1 and construct the unitary matrix 
which has v as its first row and the remaining d — 1 rows 
any orthogonal basis of the support of I — v T v. Then 
the columns of this unitary matrix form a set of or- 
thogonal states, and projectors onto these states satisfy 

(VlllilV) =Pi- 

This procedure is extremely wasteful in terms of d,Q, 
the Hilbert space dimension used to represent generic 
data. This raises the following: 

Open Problem 1 Is there an efficient procedure for 
finding the smallest Hilbert space dimension required to 
represent an arbitrary data table in terms of pure states 
and projective measurements ? 

Although we do not have an answer to this problem, we 
note that being able to solve it would allow one to (ap- 
proximately) solve a quantum one way communication 
complexity problem. Thus Open Problem 1 may bene- 



fit from studies in this field. To see the relation, note 
that by sending K copies of systems prepared accord- 
ing to a procedure V^ l \ a party Alice can have another 
party Bob estimate the probability of the k th outcome 
of some measurement M.^' occurring. The fixed accu- 
racy of Bob's estimation is determined by the size of K. 
Thus, if the systems realizing the preparations and mea- 
surements consist of n qubits then Alice needs to send 
Bob nK qubits for him to obtain his estimation of the 
desired probability. But suppose that we consider 
and to be specified by binary strings x% and yj and 

denote the probability that Bob is attempting to esti- 
mate by f(xi,yj). Then requiring Bob to perform the 
estimation of f {x%,yj) with Alice sending as few qubits 
as possible is a quantum one way communication com- 
plexity problem. Thus the efficient procedure required by 
Open Problem 1 would allow us to minimize n and thus 
(given that K is a constant overhead determined only by 
the accuracy of Bob's estimation) allow us to solve the 
associated quantum one way communication complexity 
problem. 



The lesson of Proposition 1 is that, at its most basic 
level, the probabilistic structure of quantum mechanics is 
without physical content. To obtain such content extra 
assumptions or restrictions must be imposed. For exam- 
ple, assuming that measurements on distinct systems are 
described by separable (tensor products of) projection 
operators, the Hilbert space structure we build to yield 
D cannot be guaranteed to reproduce all possible corre- 
lated probability distributions between multiple systems 
[27]. 



These precautions are well known but bear repeating, 
because in this paper we will be looking at represent- 
ing a data table via a different probabilistic formalism, a 
formalism essentially that of classical probability theory 
It is well understood that the ontological model formal- 
ism allows for an arbitrary data table to be represented 
- therefore our goal is to investigate more than simply 
whether such models can reproduce empirical observa- 
tions. In fact we are primarily interested in using these 
models to obtain a deeper understanding of the restric- 
tions which occur on any proposed realistic explanation 
for quantum mechanics. As such we will be looking at 
the structure of such models when they are used to model 
data tables corresponding to hypothetical experiments 
which in quantum mechanics would be described by pure 
state preparations and rank one projective measurements 
on a system of a known Hilbert space dimension. There- 
fore we will generally have the case that g?q = d, and 
Open Problem 1 will not be of concern to us. 
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III. ONTOLOGICAL FACTORIZATION OF A 
DATA TABLE 

A. Denning Ontological Factorizations 

Our primary goal is to examine factorizations of the 
data matrix D^ x \ x = 1,2, . ..m into the product of a 
d x 17 measurement matrix and an 17 x s preparation 
matrix P (both positive-real valued): 

D (x) =M (x) Pj Vx = l,2,...m. (1) 

The columns of the preparation matrix P are normal- 
ized probability distributions i.e. P is column-stochastic; 
it satisfies < Pjk < 1, and J2k Pjk = 1 f° r au • The 
interpretation of P is that its fc th column is a (classi- 
cal) probability distribution, over 17 distinct ontic states, 
which corresponds to the preparation procedure of 
D. That is, we imagine that when a system is prepared 
according to V^ k \ it really is prepared in one of the 17 
ontic states, and our ignorance of which particular ontic 
state the system is in is represented by the probability 
distribution in column k of P. 

Each row of the measurement matrix corresponds to an 
indicator function over the 17 ontic states. That is, 
is the probability of obtaining measurement outcome i, 
if the actual ontic state of the system is j, given that 
measurement x is being performed. 

Besides (1) there is one extra constraint on M, due to 
our desire to be able to interpret an ontological factoriza- 
tion as a model of the probabilistic data arising from a 
realistic framework. Specifically, regardless of which on- 
tic state a system is in, one of the measurement outcomes 
must pertain. For instance, if we consider the x th mea- 
surement performed on a system actually in ontic state 
j, then one of the d outcomes must occur. As such, we 
need, 

»,/ I- (2) 

i=i 

for each of the j = 1 ... 17. Thus the are also 

necessarily column-stochastic. This is actually an im- 
portant extra constraint - without it one can reproduce 
quantum statistics with non-contextual ontological mod- 
els not having negative indicator functions - which has 
been shown not to be possible by Spekkens [8]. (They 
would have a decidedly weird ontology however, as they 
would require a strange non-separable conspiracy be- 
tween preparation and measurement devices.) 

With the above constraints in mind, we can define an 
ontological factorization more formally as: 

Definition 1 A data table D possesses an ontological 
factorization ( OF) over £7 ontic states if there exists an 
17 x s, column- stochastic matrix P , and d matrices 
each m x 17 dimensional and column- stochastic, which 
satisfy L>0) = M^P for all x= 1,2,... d. 



If the entries of M are all or 1 , then the ontological 
factorization is deterministic, otherwise it is indetermin- 
istic. 

B. Model 1: An indeterministic ontological 
factorization 

We now briefly present a trivial OF, which is easily 
performed for any data table. This will be rendered much 
less trivial in subsection III E. Specifically, in this OF of 
the data table D we choose 

M (x) = D (x) and p = L 

This OF has 17 = s. That is, the number of ontic states is 
equal to the number of preparation procedures (quantum 
states) used in constructing the data table. In terms of 
an ontological model, this OF implies that preparing a 
system in a quantum state \ipj) is equivalent to preparing 
the system in a single (determined) ontic state (since each 
column of P has only one non-zero entry). In the termi- 
nology of [11] this model is V'-ontic (and ■(/'-complete). 

Measurements in this model are represented by inde- 
terministic indicator functions - the model simply speci- 
fies what the probability is that a system in a given ontic 
state yields each outcome of any specified measurement. 

This ontological model, when extrapolated to a "con- 
tinuum limit" (i.e. constructed for a data table of all 
measurements and preparation procedures for a quantum 
system of some fixed Hilbcrt space dimension), becomes 
the model of Beltrammati-Bugajski [12], which was pre- 
sented as example number 1 in [7]. In that model the 
ontological state space is simply the complex projective 
space, the quantum state is represented by a Dirac delta 
function distribution, and the indicator functions over 
the space are indeterministic, with probabilistic weights 
defined by the Born rule. 

Model 1 formally encapsulates the commonly ex- 
pressed viewpoint that the quantum state is the state 
of reality. 

C. Model 2: A deterministic ontological 
factorization 

We now turn to a deterministic OF which can be con- 
structed for any data table. This OF has 17 = d m ontic 
states. It is convenient to index the ontic states (that 
is, the rows of P and columns of M) by an m-tuple of 
integers of the form . . . ,j m ) where each of the j x 

ranges over 1,2, ... ,d. We then assign probabilities over 
the ontic states to create P as follows: 

m 
x=l 

Alternatively we could specify that the fc th column of 
P, denoted P-^, is formed from the k th column of D as 
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follows: 



P,k 



x=l 



Consider summing over index j\ 



E ^(31,32,-, 



3m),k ~ 11 
jl z=l 



(x) 



(3) 



since £)W is column-stochastic. We see that in general 
summing over an index simply removes all terms from 
the product which contain that index. Summing over all 
row indices j x in this way we confirm that P is column- 
stochastic as required. 

We now consider how to construct row i of M^), which 

(x) 

corresponds to the indicator function for II) . We assign 
the value 1 to every entry in this row for which the col- 
umn index (j 1 ,j 2 , ■■■,3m) has j x = i. More formally, 

M { f - . ,=6n . 

1,(31,32 Jm) 

To see that this works as desired, take, for example, i = 1, 
so we are dealing with the first row of M^ x > . We compute 
the inner product of this row with the matrix P: 

E M h(h ,h ,-,j m ) % 



jl, — Jm 

E P(l,32,...j m ),k 



< e n D . 

fa,.. .,3m x = 2 
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i,fe- 



where we have used the observation in Eq. (3) multiple 
times. Thus we see that we reproduce the required statis- 
tics of the data table. 

An example may help illustrate the idea. Consider the 
following data table with d = 3 and m = 2: 



D = 
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1/3 1/3 
2/3 
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1/3 1/2 
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(4) 



Following the construction above, the matrices M and P 
are given by 
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There arc two points 
Firstly, unlike Model 1, 



(5) 

to make about this model, 
the columns of the matrix 
P from Model 2 are not orthogonal (disjoint). That 
is, the probability distributions corresponding to non- 
orthogonal states overlap. Such models are called tjj- 
epistemic in [11]. In such models, knowing the ontic state 
of the system does not allow one to infer with certainty 
what quantum state would describe the same prepara- 
tion. Unfortunately, this is only true in the case of a finite 
data table, m < oo. In the continuum limit of this model, 
the probability distributions of non-orthogonal states be- 
come disjoint. To prove this rigorously requires a diver- 
sion into the infinite tensor product structures first con- 
sidered by von Neumann [13], and we will not do so here. 
We point out, however, that even though it is "0-ontic in 
the continuum limit, it is not, in the terminology of [11], 
■0-completc. (A ^-complete model would have only one 
ontic state consistent with any given quantum state.) 

Finding models which arc -0-cpistemic in the contin- 
uum limit is not easy, some attempts were given in [14], 
and we believe Barrett has made some recent progress 
[9] . Because there is a continuum of quantum states, one 
way to ensure an ontological model representation of all 
states and measurements in a Hilbert space of dimension 
dq is V'-cpistcmic is simply to find a model which makes 
use of only a finite number of ontic states. Hardy [15] 
has shown that this is not possible - as the number of 
states and measurements increases, the number of ontic 
states must increase. However, Hardy's analysis relies 
only on the fact that we must be able to divide the on- 
tic state space into disjoint sets (to represent orthogonal 
measurements), and that as we go to the continuum limit 
there are an infinite number of orthogonal measurements 
that need to be represented in this manner [28] . Now the 
number of ways of dividing £1 ontic states into disjoint 
sets grows exponentially. It would therefore seem that 
the possibility remains open that the number of ontic 
states required for an indeterministic model might grow 
only logarithmically as the size of the data table. 

But in fact this is not possible. To see this, con- 
sider the simplest case of OF's of data tables having 
d = 2, which we can show must satisfy the tight bound 
fl = min{2 m ,s}. Clearly Models 1 and 2 set an upper 
bound of f2 < min {2™, s} on the required number of on- 
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tic states, by providing deterministic and indetcrministic 
factorizations having fl = s and £1 = 2 m respectively. To 
give the tight bound claimed, it remains to show that for 
given s and m there always exists a data table requiring 
an ontological factorization having > 2 m if 2 m < s 
or £1 > s if s < 2 m . We will give a general construc- 
tion for a data table that lower bounds CI and then show 
how it applies in the case of each of these inequalities 
between s and m. First note that we can write the data 
tables we consider as including only the probabilities for 
the first outcome of each measurement (since d = 2 the 
probabilities of the second outcomes are then trivially de- 
termined). Thus we can consider a data table with some 
parameters rh and s having columns made up from all 
m-bit binary strings. For example, if rh — 3 and s — 8 
then the data table thus constructed would be, 
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(6) 

But an OF of any data table of this form must employ 
at least 2 m ontic states. This follows since the outcomes 
of measurements M.^\ and Ai^ are completely 

determined given any of the 8 preparations shown (i.e. 
the relevant entries of D are all 0/1); therefore any ontic 
states in the supports of these preparations (i.e. assigned 
a non-zero value in the relevant column of an OF's P 
matrix) must be deterministic (take 0/1 values) with re- 
spect to these three measurements. Thus in the example 
given in (6), we must use at least 8 ontic states to deal 
with the 8 distinct possibilities for assigning outcomes 
to M (1 \ M {2) and . Clearly, by the same reason- 
ing, such a data table constructed with 2 m preparations 
will require > 2™. Similarly, any data tables having 
general parameters s and m and containing sfifr-tables 
of the form (6) will require OFs with at least 2™ ontic 
states. Thus it only remains to determine the largest 
such sub-table - i.e. largest value of rh - that can fit 
in data tables with parameters subjected to each of the 
constraints 2 m < s and 2 m > s. If we have 2 m < s then 
we can find a sub-table of the form (6) having rh = m 
and s = s, giving O > 2 m = min{2 m ,s}. If however, 
2™ > s, then the largest sub-table of the form (6) that 
we can find will have rh = log 2 (s) and s = s, yield- 
ing f2 > s — min{2 m , s}. Thus in both cases the lower 
bound fl > min{2 m ,s} holds. Combining this with the 
trivial upper bound provided by Models 1 and 2 gives the 
tight bound £1 = min{2 m , s}, showing that as the size of 
a data table is increased, the number of ontic states must 
grow faster than a logarithmic dependence on the param- 
eters s and m. The general question of how many ontic 
states are required to represent any particular given data 
table is taken up in Section IV. 

The second point to make about Model 2 is that the 
number of ontic states grows exponentially with the num- 



ber of measurements being performed. However, this 
need not always be the case, as can be seen for mod- 
els in which depends on m by invoking Caratheodory's 
theorem to yield a lower bound on f2 [29]. In the next 
section we explicitly demonstrate such a lower bound by 
introducing our final model; a deterministic ontological 
factorization having = s (dm — 1) [30]. 

D. Model 3: A deterministic ontological 
factorization with fl = O(poly(m)) 

Our final ontological factorization is tailored to employ 
a number of ontic states that does not increase exponen- 
tially with the number of measurements to be described. 
This model is best illustrated by an example of its con- 
struction. Consider the data table from (4) (the method 
we outline is trivially adapted to an arbitrary data table). 
We deal with each column of D separately. Suppose that 
we are attempting to reproduce the second column of D, 
corresponding to some preparation procedure V^; 

- 2 " 

3 
1 
3 
_0_ 

1 
2 
1 
2 

. . 

We begin by finding the smallest entry in the chosen 
column, which in this case takes a value of | , and is asso- 
ciated with the second outcome of the first measurement. 
We then move this entry into another column vector, to 
be added to what remains of the original. We also re- 
move the same value of | from one of the second column 
entries associated with the remaining measurement - re- 
stricting our choice only to ensure that the chosen entry 
is greater than |. Referring to the second column of D 
shown above, we see that there are two such entries to 
choose from, associated with the first and second out- 
comes of the second measurement and both taking the 
value | > i. We choose the entry associated with the 
first outcome, and thus obtain; 
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The new binary column vector we have generated pro- 
vides a valid set of deterministic measurement outcomes 
for an ontic state (weighted by i). In particular, note 
that by subtracting the same value from a single entry 
associated with each measurement we have ensured that 
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the binary vector satisfies the stochasticity requirement 
of (2). We can repeatedly apply this procedure to the 
'remainder' vector [|, 0, 0, |, |, 0] on the right hand side 
of the above expression, until only column vectors with 
binary entries remain. This gives; 



often allows us to exceed this value, so long as the pro- 
cedure is applied carefully, exploiting repeated entries in 
D to reduce the number of ontic states we need to intro- 
duce. By this token the example given above has = 6 
- almost half of s(dm — 1) = 10. 
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The three 0/1 vectors generated by this procedure can 
be used as valid columns of an M matrix, and we can 
take the associated probabilistic weightings to form a P 
matrix row corresponding to the preparation V^ 1 '. Thus 
by introducing three ontic states we have formed P and 
M matrices that can reproduce the the quantum statis- 
tics for all measurements from D performed on the single 
preparation V^ 2 \ Repeating this process to introduce a 
new set of ontic states for the remaining column of P 
(corresponding to a preparation V^), yields the follow- 
ing deterministic ontological factorization, 



M 



1 1 
10 10 
110 



10 110 
1 1 
1 



P 



Note that since we introduce a whole new set of on- 
tic states for each preparation, the entries of different 
columns of our P matrix are bound to be completely dis- 
joint. Thus, following the terminology of [11], Model 3 
can be classified as ^-ontic. 

The procedure outlined above will generally require in- 
troducing dm — 1 ontic states for each preparation from 
D, which suggests Jl = s(dm — 1). However, Model 3 



E. Deterministic ontological factorizations from 
indeterministic ones 



In this section we will examine a process for turning 
the indeterministic OF of Model 1 into a deterministic 
OF. First a point of notation: from now on we will often 
use the two dimensional array versions of the data table 
and measurement matrices: 









1 M« \ 


D = 




, M = 






£)(«») 







so that the OF becomes simply D = MP. 

The simplest version of the procedure we outline for re- 
covering determinism from indeterminism is most easily 
illustrated by an example. Consider the following data 
table: 



D = 



1/2 2/3 
1/2 1/3 



3/4 
1/4 




The OF of model 1 would have M — D and 



P = 



That is, SI = 2. The first step is to notice that each of the 
probabilities in the first column of D is an integer multi- 
ple of 1/4, while each in the second column is an integer 
multiple of 1/3. We therefore expand the supports of the 
probability distributions in P from single ontic states to 
4 and 3 ontic states respectively: 



P 



( V 4 


\ 


1/4 





1/4 





1/4 








1/3 





1/3 




1/3 J 



Note that the probability distributions are still disjoint 
(in the language of [11] the model is V'-ontic but not ip- 
complete). We now want to choose a matrix M with 
only 0/1 entries. If we choose the first row of M (i.e. the 
indicator function for the first measurement outcome) to 
be 



110 110 



then we will certainly produce the first row of D. The k( 
is that the disjointness of the two columns of P enabl 
us to assign l's and O's to M's first row in an essential 
unrestricted fashion - the first 4 elements of this row sir 
ply need to pick up enough 1/4's from P to give the enti 
1/2 for tpi, and similarly the last three entries must pi< 
up an appropriate number of the 1/3's to give the e: 
try 2/3 for ip2- Assigning the second row we need to t 
more careful. One assignment which naively would see 
to work, in as much as it would reproduce the data tab 
entries when multiplied by P, is 

( 1 1 1 ) ■ 

However this violates the constraint of colum: 
stochasticity on the M^'s from Definition 1. Such £ 
indicator function would imply that a system in ont 
state 1 gives both the first measurement outcome ar 
the second outcome with certainty! The only choice for 
this row once we have chosen the first row as above is 

( 1 1 1 ) ■ 

Proceeding in this manner, one possibility for M is 

/l 1 1 i o\ 

1 1 1 

1110 111 

\ 1 0/ 

Imagine now that the data table only contains rational 
fractions as entries. This would certainly be the case if 
we were considering a table which was assembled from ac- 
tual experimental data. Clearly, generalizing the above 
procedure will allow us to find M and P matrices so 
as to obtain a deterministic OF. This does not give us 
a procedure for obtaining a finite OF for any data ta- 
ble however, since we may wish to consider ones con- 
structed from quantum states and projectors which have 
irrational valued overlaps. For the general program of 
research into ontological models there is no requirement 
of finitencss - in particular the trick of spreading the sup- 
port of the probability distribution over extra ontic states 
can readily be performed by spreading the distribution 
over variables of cardinality 2 N ° . This yields a procedure 
for transforming any indeterministic ontological model 
into a deterministic one - and is essentially what Bell did 
[16] (sec example 4 in [7]) to provide a counterexample to 
'von-Neumann's silly assumption'. In doing so one has 
increased the number of ontic states required to describe 
the system - the extent to which this is necessary, as 
opposed to merely sufficient, is an interesting question, 
which to some extent we take up again in Section IV. 

In the last four sections we have derived a series of 
bounds on the number of ontic states that ontological fac- 
torizations must employ. By a simple extension of the ar- 
gument given in Sec. Ill C we have that data tables with 
arbitrary parameters s,d,m require a number of ontic 
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FIG. 1: (Color online) The data table of quantum mechanical 
statistical predictions for the Kernaghan set of 11 measure- 
ments and 20 states. The values of the entries are color coded 
as black, green, red and white for 1, 1/2, 1/4 and respec- 
tively. The blue lines separate different 4-outcome PVMs. 

states that is tightly bounded by = min{d m , s}. How- 
ever, the explicit construction that we gave in Sec. Ill D 
showed that by letting f2 depend on all three param- 
eters of a data table, the number of ontic states need 
not depend exponentially on m. These bounds clearly 
have interesting repercussions for the ability to classi- 
cally simulate quantum systems efficiently. For exam- 
ple, our lower bound f2 > min{d m ,s} shows that one 
cannot precisely reproduce a general data tables quan- 
tum statistics by keeping track of a number of ontic 
states that grows only logarithmically with the number 
of preparations and measurements considered. Extend- 
ing and refining the bounds we have presented is an in- 
teresting direction for future research, which may bene- 
fit from existing results in the literature. For example, 
Aaronson has shown in [17] that the probabilities for a 
set of m two-outcome measurements performed on an ./V 
dimensional quantum system can be approximately cal- 
culated (to some fixed accuracy) from a classical string of 
length O(log(A01oglog(A01og(m)). This shows that a 
data table containing m such measurements and some 
set of preparations associated with an N dimensional 
quantum system has an approximate ontological factor- 
ization that employs Q = 2°( lo sW i°gi°sW lo s( m )) = 
TO O(iog(A0io g iog(iV)) ontic states ( w herein each ontic state 
encodes a possible instance of the string one can con- 
struct to encode the required statistics). Thus, in cases 
where one is satisfied with approximately reproducing the 
entries of a data table, the results in [17] give a lower 
bound on £1 which depends pseudo-polynomially on m 
and N, but interestingly not on s (the precise number of 
preparations in the data table). 

In Sec. IV D we will also note how existing results con- 
cerning matrix factorization problems might prove useful 
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in deriving more bounds on Q. 



F. Contextuality in ontological factorizations 

By the famous result of Kochen and Specker [18] de- 
terministic ontological models must be measurement- 
outcome contextual. We now consider how measurement 
outcome contextuality manifests itself in an OF of a data 
table. Contextuality needs to be considered when differ- 
ent quantum measurements share a common projector. 
In such data table might look something like 

(with d = 3): 



" 0.08 


1 





G7 





09" 


0.51 













0.41 













0.08 


1 





67 





09 


0.92 













. 0.00 














(7) 



Rows 1 and 4 contain the same entries, and so in quan- 
tum mechanics they could correspond to measurement 
outcomes represented by the same projector. Initially 
one might have been tempted to simply exclude the re- 
dundant rows. The essence of the Kochen-Specker the- 
orem is that doing so would prevent us being able to 
explain the data table in terms of a deterministic onto- 
logical model, and so we retain them. It is at this point 
that we depart from the r-p formalism. In the r-p formal- 
ism there is a unique r-vector for each quantum projector. 
As mentioned above, this presumption of measurement 
outcome non-contextuality is the reason the r- vectors re- 
quire negative entries (or entries greater than 1) [31] . 

In terms of our OF's, the contextuality required by the 
Kochen-Specker theorem manifests itself as follows: If 
two rows of the data table are identical, then one cannot 
necessarily find an OF such that the two corresponding 
rows of M are identical. Now, if the only constraint on 
the factorization was that D = MP with M a 0/1 ma- 
trix, then this would not be true, since there would be 
nothing stopping the replacement of one of those rows of 
M by the other. The key issue, as mentioned above, is 
the extra column-stochasticity constraint on the factor- 
ization, which forces at least one measurement outcome 
to be obtained for any given ontic state. 

Let us demonstrate all this with an example. In [19] 
Kernaghan provided an example of a set of measurements 
in a 4 dimensional Hilbert space for which a Kochen- 
Specker obstruction exists (see also [20]). These measure- 
ments are built from the following 20 states in a Hilbert 
space of dimension 4 (the normalization factors of 1/ ^/2 



or 1/2 arc omitted, and 1 denotes — 1): 



I0i) 
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1). 


1015) = 
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0), 
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1), 
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(1 
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hM 


= (1 1 
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1018 ) = 
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= (1 
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I0io) 


= (0 1 
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020) = 


(1 
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1). 



The 11 sets of 4-outcome measurements which lead to a 
Kochen-Specker obstruction can be tabulated as follows: 

/ 1 1 1 1 17 17 18 19 15 6 9 \ 

2 2 3 4 18 19 19 20 16 7 8 (g) 

3 5 8 14 19 9 14 5 11 11 13 

\ 4 6 10 15 20 10 16 7 12 13 12 / 

So, for example, the third measurement II^ 3 ) consists of 
the projectors {|0i)(0i|, |03)(03|, |08)(08|, |0io)(0io|}- 
The proof that these projectors cannot be assigned a 
unique truth value (i.e. independent of the other pro- 
jectors in the measurement) is trivial. In each of the odd 
number (i.e. 11) of measurements, only one projector can 
(and must) be assigned the truth value 1. This is in di- 
rect conflict with the fact that each of the |0i)'s appears 
an even number of times in (8). 

The data table for these 11 measurements and the 20 
states is depicted in a color coded fashion in Figure 1. 
Applying the procedure of section III E allows us to gen- 
erate matrices M and P for a deterministic OF over 
20 x 4 = 80 ontic states. The matrices for this OF are 
depicted in Figure 2. 

An example of the contextuality of this OF can be 
seen by considering rows 3 and 10, which correspond to 
the same measurement projector |03)(03| (these rows are 
marked on M in Figure 2 by red arrows). Clearly each 
of these indicator functions must have a 1 at all of the 
set of ontic states which \ip 3 ) has support over (c.f. the 
third column in P - that is, ontic states 10 through 13). 
We see, however, that there are some ontic states (71, 
75 and 79 for example) that do not lie in the support 
of |03)), and to which one of rows 3 or 10 assigns a 1 
while the other assigns a 0. Ontic states such as 71, 75 
and 79 also show that there are some ontic states which 
are assigned a value 1 by the row in M corresponding to 
103) (03 1 but not by the column in P corresponding to 
|03). As discussed in [7] this is a general (necessary) fea- 
ture of all ontological models: there are necessarily some 
ontic states which pass a test |0)(0| for a system "being 
in" a state which can never be prepared when the 
preparation procedure specifies the quantum state to be 
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FIG. 2: (Color online) P and M matrix for a deterministic OF of the Kernaghan set of preparations and measurements, 
black squares denote a value of 1, the red of 1/4 whilst the white squares represent 0. 



The 



This asymmetry of deterministic ontological mod- 
els between preparations and measurements, which is not 
present in quantum mechanics, was termed deficiency in 
[7]. The terminology comes from the fact that the sup- 
port of the probability distribution corresponding to the 
quantum state is strictly less than the support of the in- 
dicator function corresponding to a projection onto that 
state, as we have seen. We term unfaithful those ontic 
states which do not lie in the support of the probability 
distribution, but which do pass the test (since for some 
measurements they will "choose" to give a different out- 
come). Note that if an ontological model is therefore to 
be able to describe preparing the state by performing 
a measurement of \tp) it is going to be the case that a 
system in an unfaithful ontic state must be disturbed by 
the measurement in order to end up in one of the faithful 
states. 



IV. ONTOLOGICAL COMPRESSION 

So far we have investigated OF's without particular 
regard to how large the number of ontic states £1 is re- 
quired to be. In this section we consider the problem of 
trying to cither find OF's with f2 as small as possible, 
or to reduce the value of il for a given OF - a process 
we call "ontological compression" . Such a reduction in 



ontic states clearly has repercussions for the ability to 
efficiently simulate quantum systems classically - a point 
we elaborate on in Sec. IV C. But, as we shall see, our 
ontological compression schemes also provide routes to- 
wards constructing ■f/'-epistemic models in which we can 
truly view the quantum description of a system as a state 
of knowledge. 

Models 1 and 2 gave OF's which require 57 = s and 
fi = d m respectively - these values set upper bounds on 
£1 for indctcrministic and deterministic OF's respectively. 
But since deterministic ontological factorizations can be 
seen as a special case of indeterministic ones then Model 2 
also provides an upper bound of f2 = d m for indetermin- 
istic factorizations. There are therefore upper bounds 
on f2 for both deterministic and indeterministic models 
which depend only on a data tables measurements, and 
upper bounds for indctcrministic models which depend 
on only s. One may therefore wonder whether an upper 
bound can be found for deterministic models which de- 
pends only on s. But this is not possible; a deterministic 
ontological factorization always requires at least dm + 1 
ontic states. 

To see why, note that each ontic state, Aj, from a de- 
terministic model is associated with a length dm binary 
string, given by the i th column of the models M ma- 
trix. Thus the ontic states of any deterministic ontologi- 
cal factorization can be thought of as column vectors, Ai, 
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defining vertices on a unit hypcrcubc of dimension dm; 
€ dm C [0, l] dm . Now consider how such a model repro- 
duces the k th column of the data table D (corresponding 
to the measurement statistics for some preparation pW) 
in terms of these \i column vectors. Writing the relevant 
column of D as a vector p^ k ^ € [0,1] m containing the 
outcome probabilities for each measurement, its entries 
are reproduced by a convex combination of the A^, 



(9) 



Where we use to denote the i th element of the k th 
column of the OFs P matrix. Thus deterministic models 
can be thought of as reproducing D by convexly summing 
over a set of VL points in a dm dimensional space. But at 
least dm + 1 such points must be convexly summed for us 
to be able to represent an arbitrary point from [0, 1] m . 
Thus we require at least dm + 1 columns of M, i.e. at 
least dm + 1 ontic states. 

We can also set a trivial lower bound on SI: 

Proposition 2 Any ontological factorization must be 
such that 51 > rank(_D). 

This follows immediately from the fact that D = MP, 
the ranks of M, P cannot exceed Jl, and for general ma- 
trices rank(AB) < min(rank(A), rank(£?)). 

As an example of where this bound can be attained in 
a deterministic OF, consider the data table: 



D 



1 1/2 1/2 1/2 1/2 
1 1/2 1/2 1/2 1/2 



1/2 1/2 1 
1/2 1/2 



1/2 1/2 

1 1/2 1/2 



1/2 1/2 1/2 1/2 1 
1/2 1/2 1/2 1/2 1 



(10) 



This is the data table for a qubit, where the measure- 
ments are the projectors onto the eigenbases of the Pauli 
X, Y, Z matrices, and the states are simply the eigen- 
states of these three operators. This data table has the 
following OF: 



M 



(I 
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0\ 








1 1 


1 





1 
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1 
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Vo 


1 


1 0/ 



p 



/ 1 1 1 \ 

1 10 10 1 

2 1 1 1 

\ o i o i i o / 



(11) 



This OF has fl = 4, which is less than both s = 6 or 
d m = 8, and which happens to be the rank of D. (Note 
that this OF is equivalent to a single toy-bit of Spckkcns 
[I]-) 



A. Ontological compression: Method 1 

The first technique for ontological compression we con- 
sider can be applied to any OF, but finds particular util- 
ity in compressing the deterministic OF's generated from 
Model 1 (discussed in Section HIE). The key is to con- 
sider the possibility that two columns j and f of M, cor- 
responding to ontic states associated with distinct and 
non-orthogonal quantum states, are identical. If this is 
the case, then one of the ontic states is redundant, since 
any row of P (i.e. any quantum state) which has support 
at j, could just as well reproduce the relevant statistics 
in D by instead having support at j'. 

For example, consider the data table given in (10) 
and suppose that we started with a deterministic un- 
compressed OF of the kind discussed in section HIE so 
that, 



M 



1 1 






1 1 
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1 
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1 1 






1 1 
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1 
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1 1 






1 1 



(12) 



P 



( 1 \ 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 



V 



(13) 



/ 



We can immediately see that several of the columns 
of (12) are already equal (specifically, columns 1 and 5, 
2 and 7, 3 and 12 and 4 and 10). We can therefore 
compress the pairs of associated ontic states together. In 
general however we may not be so fortunate, and we will 
need to manipulate the columns of M to try and make 
as many identical as possible. So long as we respect the 
constraints imposed by stochasticity and the requirement 
MP = D we are free to alter the entries of M as is 
convenient. In particular, we can change which ontic 
states a given indicator function (i.e. row of M) uses 
to reproduce any quantum states statistics within D so 
long as we respect the requirement that one and only one 
outcome of each PVM should ever occur. Consider the 
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2x2 boxes we have drawn on M in (12). These partition 
rows corresponding to distinct PVM measurements and 
sets of ontic states (columns) associated with different 
preparations [32]. In terms of these boxes, the allowed 
manipulations of M correspond to flipping the columns 
of a box, 



a b 
c d 



b a 
d c 



As an example of how one can use this freedom to com- 
press two ontic states, columns 3 and 6 of (12) can be 
made equal by simply applying this permutation to the 
bottom box containing columns 5 and 6 (shown in red). 
There is however, a subtlety in forcing columns of M to 
be equal in this way. The permutation we apply to make 
columns 3 and 6 equal will of course also alter column 
5. But initially this column was already equal to column 
1 . To retain this initial equality we must apply the same 
permutation to the bottom box containing columns 1 and 
2 (shown in blue). However, this in turn will alter col- 
umn 2, which was initially identical to column 7 and so 
to retain this equality we must then permute the bottom 
box containing columns 7 and 8 (shown in green). Per- 
forming these three permutations we obtain the following 
new expression for M, 



M = 
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100110011100 
\o 1 1 1 1 1 l/ 



(14) 



Combined with (13) this reproduces the statistics from 
D. Counting identical columns we find that in fact there 
are now 8 redundant columns in M, three more having 
appeared due to the extra reshuffling we needed to per- 
form in order to share columns 3 and 6. These redundant 
columns can be removed, which results in us compress- 
ing the four triples (1,5,9), (2,7,11), (3,6,12), (4,8,10) 
into four ontic states. We can then finally update P to 
represent this compression by combining its rows corre- 
sponding to each of the four triples of ontic states. The 
resultant compressed OF is in fact precisely the optimally 
compressed model given in (11). 

In cases with larger data tables, the compression is not 
quite so trivial. The Kernaghan OF for example, requires 
one to keep track of eighty ontic states. By applying an 
algorithm for randomly combining ontic states according 
to a natural generalization of the example given above, 
we are able to compress the Kernaghan OF to use only 42 
ontic states. This is much lower than the d m ontic states 
needed by Model 2 and almost half the number required 
by our deterministic rendering of Model 1. The result- 
ing M and P matrices are shown in Figure 3. Clearly 



the columns of the matrix P associated with this com- 
pressed OF are not disjoint, and consequently the OF is 
"0-epistemic. In fact, the approach to compression out- 
lined above will always result in a V'-epistemic model. 
It should be noted however, that the compression illus- 
trated in Figure 3 is not guaranteed to be optimal, since 
it does not saturate the bound given in Proposition 2. In 
general, one would expect the amount by which one can 
compress an OF using this approach to depend on the 
initially chosen M (picked at random for the above ex- 
amples) and the order in which one chooses to compress 
the various ontic states of the model. Optimizing these 
choices is highly non-trivial. 



B. Ontological compression: Method 2 

The second method for ontological compression we 
consider is designed to effect a compression for OF's 
based on Model 2. It is also most easily illustrated by 
an example. Let us return to the example of Eqs. (4) 
and (5), reproduced here: 
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It is convenient to view the 9 entries corresponding to 
each of the two columns of P as probability distributions 
over 9 "ontic state boxes" as follows: 
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, P^ = 
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The indicator functions arc all of the form 



1 








1 








1 
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1 





















etc. 



That is, the construction of Model 2 is such that summing 
V^ k ' along a column of the boxes yields the probability 
for the corresponding outcome of the first measurement, 
summing along a row yields the same for the second mea- 
surement. Thus it is clear that if we leave the row and col- 
umn box-sums invariant we can "shift around" the prob- 
ability weightings that the distributions p^\p^ assign 
to the ontic states. If we then find that both distributions 
assign a weighting to an ontic state, it can be removed 
completely and some ontological compression has been 
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FIG. 3: (Color online) Compressed P and M matrix for a deterministic OF of the Kernaghan set of preparations and mea- 
surements. Color coding as in Figure 2. Note that there is now some overlap in the ontic states associated with different 
preparations (cf. the different columns of P). 



achieved. In fact, we already see that Vgl = Vfl = 0, 
and so this ontic state could be removed. 

In order to shift around the probability, one simple op- 
eration involving 4 ontic states is to map them as follows: 



Pi,3 - 


- o, 




T>ij> - 


- Pij' 


+" Pi,j, 


'/v., - 


~* Pi', 3 


+" Pi,jj 


Pi',j' 


~* Pi', 3' 


~Pi,3 



Since the final state must be a suitable probability dis- 
tribution, we require 7fy,j' > Pi,j- In our example we 
can use this to implement: 



p% - 


-> 


v m _ 


-> o, 


r® - 


-> 3/9 


7?C2) _ 

' 2,1 


-1/2, 


v w _ 

r 3,2 


■> 3/9 


1 1,2 


-1/3, 


1 3,3 


-> 1/9 


1 1,1 


-1/6. 



In this manner we nullify weighting of both distribu- 
tions on the middle ontic state (i.e. V^l=Q) , and it too 
could be removed: 



tables involved. Why might one be interested in onto- 
logical compression? Consider a "family" of data tables 
- that is a set of data tables constructed from an in- 
creasing number of states and measurements. An ex- 
ample might be data tables constructed from the family 
of all stabilizer states on rt-qubits, for increasing values 
of n. If it were the case that this data table can be 
ontologically compressed such that O only grows poly- 
normally with n, then an efficient classical simulation of 
that family of states and measurements is clearly possi- 
ble. If we denote by D n the data table for all n-qubit 
stabilizer states/measurements so that D„ has 0(4") 
rows/columns (and D\ would then be the data table of 
(10)), then we are asking whether there exists an ontolog- 
ical factorization of D n which has = (poly (n)). But 
we can immediately see that this cannot be achieved, be- 
cause any table containing all stabilizer states and mea- 
surements will contain a sub-table in which (for d = 2) 2™ 
preparations are associated with all n bit strings over n 
measurements - i.e. a sub-table of the form shown in (6). 
Since we have already seen that such a table will require 
Q > 2™, then its existence clearly prohibits us from find- 
ing ontological factorization of the stabilizer data table 
having = (poly (n)). 






1/9 


2/9 




1/6 


1/3 








■ 


3/9 


, = 


1/2 


■ 





■ 


3/9 


1/9 




■ 









(Where ■ denotes entries to be deleted.) 



C. Classical simulation and ontological 
compression 

We have seen that some sort of ontological compres- 
sion may be possible, depending on specifics of the data 



Actually, a polynomial sized number of ontic states is 
too strong a requirement for efficiency: Classical Monte- 
Carlo simulations, for example, can be provably effi- 
cient for certain problems even when the number of ontic 
states grows exponentially. Since our ontological models 
are also essentially classical probability representations, 
we similarly expect a polynomial requirement to be too 
strong for them. 
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D. Connections to other matrix factorization 
problems 

We conclude this section with a brief discussion of how 
finding OF's is related to other matrix factorization prob- 
lems. Our data tables are generally not square. However, 
in instances where (such as in the example of Eqs. (10) 
and (11)) we use the same quantum states to specify both 
the preparation procedures and the measurements, then 
D is symmetric, with non-negative entries, and positive 
semi-definite (ic doubly non- negative) . These conditions 
are necessary (though not sufficient) for the matrix D 
to be completely positive - that is, factorizable into a 
product of the form A = B^B, where the elements of 
B are non-negative. Completely positive matrices have 
been much studied (see e.g. [22]), in particular with re- 
gards to finding (bounds on) the smallest row-dimension 
of B for which this factorization is possible (called the 
"cp-rank" of B, or sometimes the "factorization index" ) . 

A completely positive matrix factorization (i.e. one 
of the form D = B^B) is not generally equivalent to 
our OF's. However, we have already seen an example 
of such a factorization appearing as an OF - the matri- 
ces M, P of the example in Eqs. (10) and (11) differ 
only by an overall multiplicative factor of 1/2, and so we 
can see this example as providing a completely-positive 
matrix factorization of this particular D. Now it is in- 
teresting to note that there is an upper bound [33] on 
the cp-rank [23] which is polynomial in rank(D), which, 
in turn, is polynomial in d,Q. The cp-rank is, in the OF 
picture, the number of ontic states Q. Thus it seems 
plausible there exist some interesting data table families 
which have polynomial (in (Iq) sized OFs. 

There is a second type of matrix factorization which 
has some connection to the OFs we have been consid- 
ering: the so-called "non-negative matrix factorization" 
(NMF). In this factorization a matrix of (non- negative- 
valued) image data V is factorized into the product of 
two matrices W, H with non-negative elements: 

V = WH. 

NMF is often considered from the viewpoint of image 
compression, where the goal is to obtain an approximate 
factorization where W (H) have as small a number of 
rows (columns) as possible. Some particularly simple pro- 
cedures for performing this approximate factorization, 
guaranteed to converge to a local optima (with respect to 



various matrix distances between the ideal V and the ap- 
proximate one) , were given in [24] . It was shown recently 
in [25] that if one considers optimizing with respect to the 
Kullback-Lcibler distance, then the local optima preserve 
the row and column sums of the original matrix. In par- 
ticular, if we consider the separate NMF factorizations 

D (x) = tf (i) Pi 

so is column-stochastic, then the results of Corol- 
lary 2 in [25] imply that the NMF found will be such 
that M^ X \P are column stochastic as we require. Of 
course our problem requires finding many such factor- 
izations with the same P for the m different D^'s. 
Interestingly, one reason that NMF is useful in image 
analysis is that it breaks the image up into "hidden 
variables" (Lee and Seung's terminology, not ours!) such 
that an image is comprised of pieces which humans 
recognise as familiar fundamental components (the ears, 
nose and mouth of a facial image for example). This is 
unlike the more standard principal component analysis 
route to "eigenimage" compression. In these terms the 
OF we are considering is asking an interesting question 
regarding factorizing many different images (the D^)'s) 
using the same set of fundamental component images 
(the P). 



V. CONCLUSIONS 

By using an ontological formalism to describe finite 
sets of preparations and measurements performed on a 
quantum system we have illustrated how many features 
of ontological models are manifested in such discretized 
scenarios. We have shown how contextuality and defi- 
ciency appear in this formalism and have also been able 
to use the formalism to build a ^-epistemic model of a 
discrete set of quantum statistics. We have also seen how 
any indeterministic description of a finite data table can 
be developed into an equally valid deterministic one and 
noted that the same technique (first employed by Bell) 
can be applied in the continuum limit. We have also 
discussed how OFs can be used to compress the number 
of ontic states needed to describe such discretized quan- 
tum scenarios and speculated on repercussions which this 
might have for classical simulations of quantum systems. 
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